AITopics | Foster City

Collaborating Authors

Foster City

Empowering Time Series Forecasting with LLM-Agents

Yeh, Chin-Chia Michael, Lai, Vivian, Saini, Uday Singh, Fan, Xiran, Fan, Yujie, Wang, Junpeng, Dai, Xin, Zheng, Yan

arXiv.org Artificial IntelligenceNov-27-2025

Large Language Model (LLM) powered agents have emerged as effective planners for Automated Machine Learning (AutoML) systems. While most existing AutoML approaches focus on automating feature engineering and model architecture search, recent studies in time series forecasting suggest that lightweight models can often achieve state-of-the-art performance. This observation led us to explore improving data quality, rather than model architecture, as a potentially fruitful direction for AutoML on time series data. We propose DCATS, a Data-Centric Agent for Time Series. DCATS leverages metadata accompanying time series to clean data while optimizing forecasting performance. We evaluated DCATS using four time series forecasting models on a large-scale traffic volume forecasting dataset. Results demonstrate that DCATS achieves an average 6% error reduction across all tested models and time horizons, highlighting the potential of data-centric approaches in AutoML for time series forecasting.

data mining, large language model, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2508.04231

Country:

North America > United States > California > San Mateo County > Foster City (0.05)
North America > United States > California > Santa Clara County > Campbell (0.04)
North America > United States > California > Santa Clara County > San Jose (0.04)
North America > United States > California > San Mateo County > San Mateo (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Transportation (0.71)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

TREASURE: A Transformer-Based Foundation Model for High-Volume Transaction Understanding

Yeh, Chin-Chia Michael, Saini, Uday Singh, Dai, Xin, Fan, Xiran, Jain, Shubham, Fan, Yujie, Sun, Jiarui, Wang, Junpeng, Pan, Menghai, Dou, Yingtong, Chen, Yuzhong, Rakesh, Vineeth, Wang, Liang, Zheng, Yan, Das, Mahashweta

arXiv.org Artificial IntelligenceNov-27-2025

Payment networks form the backbone of modern commerce, generating high volumes of transaction records from daily activities. Properly modeling this data can enable applications such as abnormal behavior detection and consumer-level insights for hyper-personalized experiences, ultimately improving people's lives. In this paper, we present TREASURE, TRansformer Engine As Scalable Universal transaction Representation Encoder, a multipurpose transformer-based foundation model specifically designed for transaction data. The model simultaneously captures both consumer behavior and payment network signals (such as response codes and system flags), providing comprehensive information necessary for applications like accurate recommendation systems and abnormal behavior detection. Verified with industry-grade datasets, TREASURE features three key capabilities: 1) an input module with dedicated sub-modules for static and dynamic attributes, enabling more efficient training and inference; 2) an efficient and effective training paradigm for predicting high-cardinality categorical attributes; and 3) demonstrated effectiveness as both a standalone model that increases abnormal behavior detection performance by 111% over production systems and an embedding provider that enhances recommendation models by 104%. We present key insights from extensive ablation studies, benchmarks against production models, and case studies, highlighting valuable knowledge gained from developing TREASURE.

large language model, machine learning, treasure, (20 more...)

arXiv.org Artificial Intelligence

2511.19693

Country:

North America > United States > California > San Mateo County > Foster City (0.40)
North America > Canada (0.04)
North America > Puerto Rico (0.04)
(2 more...)

Genre: Research Report (0.82)

Industry:

Information Technology (0.93)
Banking & Finance (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Möbius Transformation for Fast Inner Product Search on Graph

Zhixin Zhou, Shulong Tan, Zhaozhuo Xu, Ping Li

Neural Information Processing SystemsNov-15-2025, 20:53:24 GMT

Nearest Neighbor (ANN) search methods may not perform efficiently in the non-metric similarity measure.

data mining, information retrieval, machine learning, (21 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Long Beach (0.04)
North America > Canada > Quebec > Montreal (0.04)
North America > United States > Texas > Dallas County > Dallas (0.04)
(16 more...)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
(2 more...)

Add feedback

Möbius Transformation for Fast Inner Product Search on Graph

Zhixin Zhou, Shulong Tan, Zhaozhuo Xu, Ping Li

Neural Information Processing SystemsOct-2-2025, 02:41:39 GMT

Neural Information Processing Systems http://nips.cc/

data mining, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Long Beach (0.04)
North America > Canada > Quebec > Montreal (0.04)
North America > United States > Texas > Dallas County > Dallas (0.04)
(16 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining (0.94)
(2 more...)

Add feedback

Here Come the Robotaxis: Zoox and Lyft Both Launch Driverless Ride Sharing

WIREDSep-10-2025, 13:00:00 GMT

Two new self-driving car services--one in Atlanta from Lyft and May Mobility, another in Las Vegas from Amazon subsidiary Zoox--prove that the robotaxi race is still on. Now comes the hard part. Today, two robotaxi firms operating on opposite sides of the US say they're opening their services to the public. The Ann Arbor tech developer May Mobility has launched its self-driving car service on the Lyft app in a section of Atlanta, Georgia. Starting today, anyone who orders a Lyft in the area might be paired with an autonomous vehicle.

artificial intelligence, robotaxis, vehicle, (15 more...)

WIRED

Country:

North America > United States > Nevada > Clark County > Las Vegas (0.27)
North America > United States > Georgia > Fulton County > Atlanta (0.24)
North America > United States > California > San Francisco County > San Francisco (0.07)
(9 more...)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)

Add feedback

Fast and Accurate Collision Probability Estimation for Autonomous Vehicles using Adaptive Sigma-Point Sampling

Cossette, Charles Champagne, Clawson, Taylor Scott, Feit, Andrew

arXiv.org Artificial IntelligenceJul-9-2025

A novel algorithm is presented for the estimation of collision probabilities between dynamic objects with uncertain trajectories, where the trajectories are given as a sequence of poses with Gaussian distributions. W e propose an adaptive sigma-point sampling scheme, which ultimately produces a fast, simple algorithm capable of estimating the collision probability with a median error of 3.5%, and a median runtime of 0.21ms, when measured on an Intel Xeon Gold 6226R Processor . Importantly, the algorithm explicitly accounts for the collision probability's temporal dependence, which is often neglected in prior work and otherwise leads to an overestimation of the collision probability. Finally, the method is tested on a diverse set of relevant real-world scenarios, consisting of 400 6-second snippets of autonomous vehicle logs, where the accuracy and latency is rigorously evaluated.

artificial intelligence, collision probability, probability, (15 more...)

arXiv.org Artificial Intelligence

2507.06149

Country:

North America > United States > California > San Mateo County > Foster City (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)

Add feedback

Robust sensor fusion against on-vehicle sensor staleness

Fan, Meng, Zuo, Yifan, Blaes, Patrick, Montgomery, Harley, Das, Subhasis

arXiv.org Artificial IntelligenceJun-9-2025

Sensor fusion is crucial for a performant and robust Perception system in autonomous vehicles, but sensor staleness--where data from different sensors arrives with varying delays--poses significant challenges. T emporal misalignment between sensor modalities leads to inconsistent object state estimates, severely degrading the quality of trajectory predictions that are critical for safety. W e present a novel and model-agnostic approach to address this problem via (1) a per-point timestamp offset feature (for LiDAR and radar both relative to camera) that enables fine-grained temporal awareness in sensor fusion, and (2) a data augmentation strategy that simulates realistic sensor staleness patterns observed in deployed vehicles. Our method is integrated into a perspective-view detection model that consumes sensor data from multiple LiDARs, radars and cameras. W e demonstrate that while a conventional model shows significant regressions when one sensor modality is stale, our approach reaches consistently good performance across both synchronized and stale conditions.

artificial intelligence, information fusion, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2506.0578

Country: North America > United States > California > San Mateo County > Foster City (0.04)

Genre: Research Report (0.64)

Industry: Leisure & Entertainment > Games > Chess (0.40)

Technology:

Information Technology > Sensing and Signal Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.95)
(3 more...)

Add feedback

The Influence of Text Variation on User Engagement in Cross-Platform Content Sharing

Hu, Yibo, Jin, Yiqiao, Ye, Meng, Divakaran, Ajay, Kumar, Srijan

arXiv.org Artificial IntelligenceMay-8-2025

In today's cross-platform social media landscape, understanding factors that drive engagement for multimodal content, especially text paired with visuals, remains complex. This study investigates how rewriting Reddit post titles adapted from YouTube video titles affects user engagement. First, we build and analyze a large dataset of Reddit posts sharing YouTube videos, revealing that 21% of post titles are minimally modified. Statistical analysis demonstrates that title rewrites measurably improve engagement. Second, we design a controlled, multi-phase experiment to rigorously isolate the effects of textual variations by neutralizing confounding factors like video popularity, timing, and community norms. Comprehensive statistical tests reveal that effective title rewrites tend to feature emotional resonance, lexical richness, and alignment with community-specific norms. Lastly, pairwise ranking prediction experiments using a fine-tuned BERT classifier achieves 74% accuracy, significantly outperforming near-random baselines, including GPT-4o. These results validate that our controlled dataset effectively minimizes confounding effects, allowing advanced models to both learn and demonstrate the impact of textual features on engagement. By bridging quantitative rigor with qualitative insights, this study uncovers engagement dynamics and offers a robust framework for future cross-platform, multimodal content strategies.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2505.03769

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > Texas > Travis County > Austin (0.14)
North America > United States > California > San Mateo County > Foster City (0.04)
(12 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Research Report > Strength High (0.68)

Industry:

Media > News (1.00)
Health & Medicine (1.00)
Information Technology (0.93)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

CATS: Mitigating Correlation Shift for Multivariate Time Series Classification

Lin, Xiao, Zeng, Zhichen, Wei, Tianxin, Liu, Zhining, chen, Yuzhong, Tong, Hanghang

arXiv.org Machine LearningApr-5-2025

Unsupervised Domain Adaptation (UDA) leverages labeled source data to train models for unlabeled target data. Given the prevalence of multivariate time series (MTS) data across various domains, the UDA task for MTS classification has emerged as a critical challenge. However, for MTS data, correlations between variables often vary across domains, whereas most existing UDA works for MTS classification have overlooked this essential characteristic. To bridge this gap, we introduce a novel domain shift, {\em correlation shift}, measuring domain differences in multivariate correlation. To mitigate correlation shift, we propose a scalable and parameter-efficient \underline{C}orrelation \underline{A}dapter for M\underline{TS} (CATS). Designed as a plug-and-play technique compatible with various Transformer variants, CATS employs temporal convolution to capture local temporal patterns and a graph attention module to model the changing multivariate correlation. The adapter reweights the target correlations to align the source correlations with a theoretically guaranteed precision. A correlation alignment loss is further proposed to mitigate correlation shift, bypassing the alignment challenge from the non-i.i.d. nature of MTS data. Extensive experiments on four real-world datasets demonstrate that (1) compared with vanilla Transformer-based models, CATS increases over $10\%$ average accuracy while only adding around $1\%$ parameters, and (2) all Transformer variants equipped with CATS either reach or surpass state-of-the-art baselines.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Machine Learning

2504.04283

Country:

Europe > Netherlands > North Holland > Amsterdam (0.04)
North America > United States > Illinois > Champaign County > Urbana (0.04)
North America > United States > Illinois > Champaign County > Champaign (0.04)
(3 more...)

Genre: Research Report (1.00)

Industry: Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Genetic Algorithm with Border Trades (GAB)

Lyu, Qingchuan

arXiv.org Artificial IntelligenceFeb-5-2025

This paper introduces a novel approach to improving Genetic Algorithms (GA) in large or complex problem spaces by incorporating new chromosome patterns in the breeding process through border trade activities. These strategies increase chromosome diversity, preventing premature convergence and enhancing the GA's ability to explore the solution space more effectively. Empirical evidence demonstrates significant improvements in convergence behavior. This approach offers a promising pathway to addressing challenges in optimizing large or complex problem domains.

artificial intelligence, evolutionary algorithm, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2501.18184

Country:

North America > United States > Illinois (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
North America > United States > California > San Mateo County > San Mateo (0.04)
North America > United States > California > San Mateo County > Foster City (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)

Add feedback